A Multi-Agent System for Distributed Information Retrieval on the World Wide Web
نویسندگان
چکیده
In this paper a novel approach concerned with the general framework of Information Management, is presented. We use a Multi-Agent System to cope with the problem of Distributed Information Retrieval. The Distributed Information Retrieval task deals with the collection of information from multiple and usually heterogeneous information sources that exist in a distributed environment, which in our case is the World Wide Web. 1 Overview of the system's architecture The advent of large wide-area networks, Internet is the most characteristic example, has caused a vast increase both in the information availability and in the number of the information sources. This evolution offers great promise for obtaining and sharing diverse information conveniently. However, the multitude, diversity and the dynamic nature of on-line information sources make accessing any specific piece of information an extremely difficult task. One way to address these issues is to use information agents. These Distributed Information Retrieval agents should be able to: accept a request from a human or agent client, translate this request into a language understood by the information sources, identify the information sources that contain information relevant to the request, pose the request to these sources, collect the corresponding results, process the returned results and present the results to the client. We have followed this approach in developing our information retrieval system for the WWW. The overall agent architecture is as follows (see Figure 1). The inter-agent communication is based on standard Knowledge Query Manipulation Language (KQML) performatives [Patil 94]. Our system supports a collection of information sites. The notion of an information site is used to describe a logical entity that contains a set of information sources. It is a logical clustering of actual-physical WWW sites. In each information site, we find the extractor agent and the information source agent. The extractor periodically scans through all the information sources, represented as URLs. These can be URLs of the top-level web pages of various research groups, for example. The extractor traverses through all the local documents (e.g. documents belonging to that research group) that are accessible via a chain of links from the top-level page. It classifies each such page as 'interesting' or not and extracts from each 'interesting' web page the key features and represents these features in a relational/attribute-based form. For example, it will describe an identified research paper in terms of attributes like authors, title, topics, keywords, document location (URL), abstract of document location (URL) and referenced authors.
منابع مشابه
Load-Frequency Control: a GA based Bayesian Networks Multi-agent System
Bayesian Networks (BN) provides a robust probabilistic method of reasoning under uncertainty. They have been successfully applied in a variety of real-world tasks but they have received little attention in the area of load-frequency control (LFC). In practice, LFC systems use proportional-integral controllers. However since these controllers are designed using a linear model, the nonlinearities...
متن کاملInformation Retrieval in a Cloud using Ontologies and Multi-Agent System
The existence of heterogeneous systems in the now existing global village brings about a lot of complexity in the search and retrieval of information. However, continuous changing fields of knowledge offer a number of standard procedures and methods made possible by the up spring of technology to help users solve their queries. This research examines the application of knowledge management proc...
متن کاملAn Agent-Oriented Personalized Web Searching System
Web retrieval is now one of the most important issues in computer science, and we believe that applying multi-agent systems to this area is a promising approach. We introduce Kodama1 system, which is being developed and in use at Kyushu University, as a multi-agent-based approach to build a distributed Information Retrieval (IR) system that lets users retrieve relevant distributed information f...
متن کاملReducing Retrieval Time in Automated Storage and Retrieval System with a Gravitational Conveyor Based on Multi-Agent Systems
The main objective of this study is to reduce the retrieval time of a list of products by choosing the best combination of storage and retrieval rules at any time. This is why we start by implementing some storage rules in an Automated Storage/Retrieval System (Automated Storage and Retrieval System: AS/RS) fitted with a gravity conveyor while some of these rules are dedicated to storage and ot...
متن کاملImprovement of Web-based service information systems using fuzzy linguistic techniques and Semantic Web technologies
The aim of this paper is to present a model of a web multi-agent system which combines the use of Semantic Web technologies together with the application of user profiles to provide an enhanced Web retrieval service. This system uses fuzzy linguistic techniques to deal with qualitative information in a userfriendly way. The system activity is developed in two phases: retrieval phase to gather t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997